CYCLIC BEHAVIOR OF MAXIMA 
IN A HIERARCHICAL SUMMATION 

SCHEME* 

M.A. Lifshits 

Abstract 

Cyclic behavior of maxima in a hierarchical summation scheme. Let i.i.d. 
symmetric Bernoulli random variables be associated to the edges of a binary 
tree having n levels. To any leaf of the tree, we associate the sum of variables 
along the path connecting the leaf with the tree root. Let M n denote the 
maximum of all such sums. We prove that, as n grows, the distributions of 
M n approach some helix in the space of distributions. Each element of this 
helix is an accumulation point for the shifts of distributions of M n . 

Key words: Hierarchical summation scheme, distribution of maximum, branching 
random walk, cyclic limit theorem. 

1 Branching random walks 

The study of extremal positions of branching random walk and branching Brow- 
nian motion is by now considered as a classical problem with first deep results 
obtained as early as in Hammersiey's work [9j. During last decade it regained a 
considerable popularity. New substantial advances were obtained but many ques- 
tions remain open. 

Let us shortly recall the notion of branching random walk, a very special case of 
which will be considered in this article. At initial (zero) time there is one particle 
located at zero. At time 1 the particle dies but gives birth to a point process 
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(configuration) of progeny that consists of a random number of particles (points 
on the real line) whose positions are, generally speaking, mutually dependent. 
Every new born particle also lives one unit of time and dies giving birth to a point 
process of progeny independent of all other analogous processes. The distribution 
of progeny process for every particle (ancestor) differs from the progeny process of 
initial particle by translation to the position of the ancestor. 

Therefore, the branching random walk is a genealogical Galton-Watson tree 
T, where every element u 6 T is additionally characterized by its position on the 
line V(u). Clearly, V(u) is a sum (over the set of ancestors of u), of independent 
random variables, each term of the sum being the displacement of a particle with 
respect to the location of its parent particle. 

We shall not consider variations of this basic model, e.g. those with random 
life time of each particle. 

Every particle u belongs to a certain generation i.e. to a level of the tree 
T. For the initial particle, we let the generation number be zero. 

The extremal positions in generations describing the generations' range are of 
special interest. They are defined by formulas 

M n := ma,x{V(u), \u\ = n}, m n := min{y(w), \u\ = n}. 

The limit theorems for the distributions of these variables are obtained in [TJ [2J El 
HI El El m E21 HD]. The y essentially assert that 

M n = cn + b n + M n , (1) 

where c is a non-negative constant, b n is a deterministic sequence varying slower 
than a linear function (most commonly, b n behaves logarithmically), and a sequence 
M n converges in distribution to some limit or is just bounded in probability. We 
stress that no multiplicative norming is needed, i.e. the family of distributions of 
the variables (M n ) n >o is shift-compact. 

Obviously, the linear term in the asymptotics of M n can be trivially eliminated 
by a constant shift of the progeny point process in the definition of the walk. 

The following recent theorem due to E. Aidekon |2j is one of the most repre- 
sentative and powerful results on the extremal positions. 

Theorem 1 Assume that the distribution of the progeny process in a branching 
random walk is non-lattice and that the following assumptions are satisfied, 



(2) 



e Yl e v(u) = i. E E ^ eV>) = °> ( 3 ) 

\H=i / \|«|=i / 

as we// as tae moment restrictions 

E |^E F(u) 2 e y ^j < oo, (4) 
E (X(\n+ X) 2 ) < oo, E (x(ln+ X)) < oo, (5) 

where X : = £ M==1 e ^«0, X := £ |u|=1 V(«)_e v W. 

XTien t/iere exists an a.s. positive random variable D such that for any rGi 
i£ ?s true tnat 

lim P ( M n < — lnn + r J = Ee^ 



Theorem [T] means that in (TjQ) we have c = 0, 6 n = — | In n, and the distributions 

of M n converge to a mixture of shifted double exponential distributions (Gumbel 
laws) . 

Assumption (j2J) is natural: it means that the branching process is supercritical. 
This condition provides sufficient number of particles in the walk. Assumptions 
([3]) mean that a linear scaling of the walk steps "killing" a linear term in ([1]) is 
performed. 

These assumptions are not too restrictive in the following sense. Consider 
a branching random walk satisfying condition fl2]) but not necessarily satisfying 
conditions ([3]). Let 

$( 7 ) := E I elV{u) I > ^(7) := ln$(7), 7 > 0. 
\M=i / 

Making a linear shift in one generation 

V(u) := jV(u) - ^(7), \u\ = l, 
corresponds to the shift of all particles 

V(u) :=yV(u) - M#( 7 ), mgT. (6) 
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Let us search for 7 > such that the analogues of ([3]) for the new walk 



E e eV{u) = !. E E = 0. 

\M=i / \M=i / 

would be valid. Note that the first equality holds automatically, since 

E ( E ) = e"*WE ( E ^ ) = e"* (7) $(7) = 1. 
\l«l=i / \m=i / 

We may rewrite the second condition as 

= E I E(7^H -^(7)) e7V(M) 
\l«l=i 

= 7 e I E v( u y v{u) ) - *( t )e I E elV[u 

\\u\=l J \|«|=1 

= 7 $'( 7 ) -*( 7 )$( 7 ), 
which is equivalent to 

i2( 7 ) := 7 *'( 7 ) - *( 7 ) = 0. (7) 

It follows easily from Holder inequality that the function is convex. There- 
fore, i?'( 7 ) = 7 \I/"( 7 ) > , i.e. is an increasing function. We have R(0) = 
-^r(O) = -ln$(0) < by (El). Hence, if 

$( 7 ) < 00, < 7 < 00, 

and 

lim R(j) = lim [ 7 *'( 7 ) - *( 7 )1 > 0, (8) 

7— >oo 7— >oo 

then equation ((7|) has a solution 7 > 0, and a liner change ([6]) reduces the study 
of the initial walk to the study of a walk satisfying assumptions ([3]). 

In the example we focus on below, no shift can render the distributions of M n 
convergent to a non-degenerate limit distribution. Instead, they approach some 
helix of distributions, or, if the shifts are allowed, they circulate along some closed 
curve in the space of distributions. There are two reasons preventing application of 
Theorem [T] in that case: first, the distributions of the progeny process is a lattice 
one; second, the reduction condition (jSJ) fails. 
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2 Hierarchical summation scheme 



In the following we consider the simplest model of a branching random walk: ev- 
ery particle produces two particles whose translations are independent Bernoulli 
random variables taking value 1 with probability p and —1 with probability 1 — p. 
Therefore, the genealogic tree T is just a simplest binary tree and the particles' lo- 
cations are described by the sums of independent Bernoulli random variables along 
the branches of this tree. It is amazing that such a simple model demonstrates an 
interesting limit behavior. 

We may redescribe the object under consideration as follows. 

Consider n- level binary tree and associate to its edges i.i.d. Bernoulli random 
variables (-B»). The tree has 2 n leafs. To each leaf we associate the sum of random 
variables picked up along the path connecting the leaf with the tree root. Let M n 
be the maximum of the sums along all leafs. We shall investigate the asymptotic 
behavior of the distribution of M n , as n goes to infinity. Clearly, we have, M = 0, 
M n e [— n, n], and M n = n (mod 2). Moreover, there is a recurrency equation 

M n+1 = max {M« + + B® } , (9) 

where Mn and B^ are independent copies of M n , resp. of the Bernoulli variable. 

It is worthwhile to notice that hierarchical summation schemes appear not 
only in connection to the branching walks. They emerge, for example, in physical 
models such as Derrida generalized random energy studied by Bovier and Kurkova 
[1]. In their setting, the summands situated on the same level of the tree have 
the same distribution but this distribution is allowed to vary reasonably from one 
level to another. 

2.1 Symmetric case 

In this subsection we consider the most interesting symmetric case 

P(B i = l)=P(S i = -l) = i. 

We start with the study of the behavior of EM n . Subsequent delicate consid- 
erations are entirely based on the following modest fact. 

Proposition 2 Let 

K n := {u : \u\ = n,V(u) = M n } 
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be the number of vertices of level n where the maximum M n is attained. Then 
K n — > oo in probability and 

lim E(M n+1 - M n ) = 1. 

Proof. Notice that K n is bounded from below by a critical Galton-Watson 
process Z n with the progeny number N defined by the law 

= 0, 

F(N = k) = {l k = l, 




and restarting from 1 at extinction time. To observe Z n on the tree, it is enough 
to keep track of the paths along which we have only Bi = +1; at each level, 
where we have extinction (the values —1 occupy all continuations of the paths we 
observe), we keep a single path and consider only its continuations - according to 
the previous rule. Remark that all chosen paths provide maximal values of sums 
on each level, hence Z n < K n . 

Look at Z n from the point of view of Markov chain theory. All states are 
recurrent and null, since the expectation of extinction time for our Galton-Watson 
process is infinite. Hence, for any fixed I G N 

lim F(Z n = £) = 0, 

n—too 

e.g. see Theorem 3 in 0, Section XIII. 3]. Hence, for any m G N 

m 

lim F(K n < m) < lim F(Z n < m) = V lim F(Z n = t) = 0, 

n— >oo n—too < * n— >oo 

1=1 

as claimed. 

Passing to the expectations, let us notice that M n+1 — M n G {—1, +1}; more- 
over, 

P(M n+ i -M n = +1| An) = l-2- K \ 
F(M n+l -M n = -l\A n ) = 2- K ", 

where A n stands for the sigma-field generated by the variables situated on first n 
levels of the tree. It follows that 

E(M n+1 -M n ) = l-2E2- K ", 
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and the second claim of the proposition follows from the first one. □ 

Proposition [2] shows that EM„ ~ n, as n grows to infinity. Hence, it suggests 
that M n is relatively close to its upper border n. Therefore, it is more convenient 



to consider the variables M' 



n—Mn 



. Then M' n is a non-negative integer random 



variable and satisfies the relations M' = 0, M' n e [0, n] and the equation 

MUi = mm {<« + S«; + B^) (10) 



where Mn {j) and are independent copies of M' n , resp. of a variable B having 
the distribution 

P(B = 1) = F(B = 0) = - . 

It is more convenient to express the recurrency equation in terms of the tails of 
random variables. Let F n (x) := P(M^ > x). Then 

V ; 10, x>0, 



and 



-^n+l(^) 



F n (x) + F n (x - V 



This equation has many invariant solutions. Indeed, an invariant solution should 
satisfy equations 



4F(x) = [F(x) + F{x - l)\ 



(12) 



Hence, F(x — 1) = G(F(x)) and F(x) = g(F(x — 1)), where G(y) := — y and 
g(y) := 2 — y — 2yT — y are mutually inverse functions. It follows that all values 
of F can be expressed via F(0) as iterations of functions g and G. The family of 
invariant distribution may be written in a parametric form {F a , < a < 1}, where 



#"(a), n > 0, 



0. 



n < 0. 



and g n , G n denote the n-th iteration of g, resp. G. It is clear that the family of 
invariant distributions form a continuous one-parametric curve (it is natural to call 
it a "helix") in the space of distributions M. (M 1 ); moreover, using the appropriate 
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shifts we can transform this curve into a closed cycle, i.e. JF 9 ^ a >{- — 1) = J- a {-) for 
any < a < 1. 

We consider now the limit behavior of F n (x) as n — > oo. Let us first handle 
the case of fixed x. The following is true. 

Proposition 3 For each x G Z it is true that lim^oo F n (x) = 1. 

Proof. Using induction in x, we derive from (fTTj) that the sequence F n (x) is 
non- decreasing in n for each fixed x. Hence, the limit F(x) := lim n F n (x) exists 
and satisfies equation (TT21) . Notice that F(x) = 1 for x < 0, and it follows from 
ffT2|) that F(x - 1) = 1 yields F(ar) = 1. Therefore, F(x) = 1 for each xeZ.D 

It is worthwhile to notice that the fist non-trivial 1, corresponds to 

the behavior of P(M„ < n), i.e to the extinction probability of a critical branching 
Galton- Watson process. It is well known from the classical works of R. Fisher and 
A.N. Kolmogorov that for such process extinction takes place almost surely. 

Proposition [3] implies that the variables M' n converge to infinity in probability. 

We pass now to the main result called a cyclic limit theorem. We will show 
that for large n the distribution F n admits an approximation by an appropriate 
invariant distribution. 

Theorem 4 For each n let a median k n be defined by the relation 

k n = inf{x G Z : F n {x) < 1/2}. 
Let a n = G kn (F n (k n )). Then 

lim sup \F n (x) - F an (x)\ = 0. 

Since F n goes along the limit helix (J ra )o<a<i with decreasing speed, it is natural 
to conjecture that all points of the helix are the limit points of F n after appropriate 
shift normalizations. In particular no shifts can render F n convergent to a non- 
degenerate distribution. The exact assertion is as follows. 

Theorem 5 For any a G (0, 1) there exist z G Z and a sequence of integers tending 
to infinity such that 

lim max \F a {x) — F nk (x + k — z)\ =0. 
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Proof of Theorem HI 

First, notice that the second claim of Proposition [2] may be also written in the 
form 

oo 

lim A n := lim E(i< +1 - M' n ) = lim Y[F n+1 {k) - F n {k)\ = 0. (13) 

n— >oo n— >oo n— >oo ' ' 

k=l 

Another necessary ingredient for the proof is the identity 

F n (x) = g(F n (x - 1) - 5) - 5, (14) 

where 5 = 8(n, x) := F n+ \(x) — F n (x) G [0, A n ]. 
Indeed, we may write (fTTj) as 



FJx) + 5 



F n (x) + 5 + F n (x - 1) - cP 2 



Taking into account that the function g satisfies the identity g(y) = ^£M±^j ) we 
arrive at (JHJ). Note for subsequent applications that ( 114"]) yields useful inequalities 



F n {x) < g{F n {x - 1)), G(F n (x)) < F n {x - 1). (15) 



Now we pass to the proof of the theorem. By using (1141) and the monotonicity 
of g(-), for each integer d > we obtain 

F n (k n + d) = g (F n (k n + d-l)-5)-5 

< g(F n (k n + d-l)) < ... < g d (F n (k n )). 

We also have 

F a -{k n + d)= g d (F a "(k n )) = g d {F n {k n )). 

For any e > 0, take a positive integer D such that g D {V) < e. Then for any d > D, 
by using monotonicity of g(-) and inequality g(y) < y, we infer 

max{F n (A; n + d); F a "(k n + d)} < g d (F n (k n )) < g d {\) < g D a) < e. 



Hence, 



max\F n (k n + d)-F an (k n + d)\<e. (16) 



Now we show by induction that for every d — 0, 1, . . . , D it is true that 

lim \F n (k n + d)- F a "(k n + d)\ = 0. (17) 
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We have chosen parameters a n so that 

F a "(k n ) = g k -(^(0)) = g k "(a n ) = g k " G k " (F n (k n )) = F n (k n ). 



Therefore, for d = the left hand side of (1171) vanishes, thus providing the induc- 
tion base. Assume that for d — 1 assertion ( TTT|) is proved, then by (jlip for d we 
have 

\F n (k n + d)- F an (k n + d)\ = \g(F n (k n + d-l)-6)- g(F a "(k n + d-l))\+8, 

where 5 := F n+ i(k n + d) — F n {k n + d) G [0, A n ]. It follows that 

\F n (k n + d)-F a "(k n + d)\ 
< [\F n (k n + d-l)-F a "{k n + d-l)\ + A n ] max \g\y)\ + A n , 

Since A n — > by (1131) . and since the function g' is bounded on [0, ~], we obtain 
that 

limsup \F n (k n + d) - F an (k n + d)\ 

n— ¥oo 

< lim \F n (k n + d - 1) - F an {k n + d - 1)| ■ max \g'(y)\ = 0. 

n.-s>oo 0<y<i 



Therefore, (IT7I) is proved. By combining (1161) with (IT7I) . we obtain 
lim max |F n (/c n + d) - F an {k n + d) \ = 0. 

n— >oo d>0 

Negative gPs are handled in the same way by using function G instead of g. □ 

Proof of Theorem [5]. Without loss of generality we may assume that | ^ 
{F a (x),x G Z}. Then there exists z£Z such that 

^(z-l)>i>^(z). 

Fix an e > and choose 5 G (0, min{a, 1 — a}) so small that b G (a — 5, a + 5) 
implies 

max\F b (x) - F a (x)\ < e. 
We may also require the inequalities 

F a -\z-\)> l -> F a+ \z). (18) 
10 



to hold. Take a positive integer n such that for all n > no it is true that A n < 
_ jca-s^ z y ^ now k e so i arge that F nQ (k) < F a ~ s (z). Consider the 

sequence f n := F n (k),n > n , for fixed k. By Proposition [3] it grows to one. Since 
f no < J ra_<5 (2;) and for all n > n it is true that 

f n+1 -f n = F n+l (k) - F n (k) <A n < F a+ \z) - F a - S (z), 

there exists n := satisfying 

F n (k) = f n e(T a -\z),T a+s (z)). 

Notice that k is the median for F n , since by (Fl8|) . ([151) 

F n (k) < F a+ \z) < l - ■ F n (k - 1) > G(F n (k)) > G(F a -\z)) = T a ~\z - 1) > 1 

Therefore, the approximating distribution J 7 "' 1 from Theorem H] satisfies the equal- 
ities 

F a "(A;) = F„(A;) = F fe (z) 

for some b G (a — 5, a + <5). Finally, we use the following fact: if J-^w) = J~ b (v) for 
some a, b G (0, 1) and some i,«/eZ, then for all x G Z we have 

•F^z + u-u) = J* 6 (x). 

In our case T an {x + k — z) = J- h {x) holds. Therefore, 

max \ J- a (x) — F n (x + k — z)\ 

x& 

< max \T a (x) - T h (x) \ + max I T h (x) - FJx + k — z)\ 

x& xGZ 

< e + max |J* a "(s + - z) - FJx + k - z)\. 

x&L 

Since e was chosen arbitrarily and the second term tends to zero by Theorem HJ 
we obtain the assertion of Theorem □ 

One of the reasons for non-existence of the unique limit distribution is the 
discrete type of Bernoulli distribution, as is clearly seen from Theorem [TJ Another, 
less obvious and may be a deeper, reason is the failure of ([8]). Indeed, in the 
hierarchical summation scheme for p-Bernoulli variables we have 

7*'(7)-*(7) = 7 P6 l " [? " P \ 6 Z ~ In 2 - In {pe< + (1 - p)e^) 

pe 1 + (1 — p)e 7 

2(l-p) 7( 



-ln(2p)- v (l + o(l)) 

p e Z-y 
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and 

lim [7*' (7) - #(7)] = -ln(2p). 

7— >oo 

Therefore, condition (jSJ) is satisfied iff p < |. 

The tree structure of the hierarchical summation scheme is not related to the 
helix-type behavior of the maxima distributions: one can obtain a similar result 
for conventional summation (see Section [3] below) . 

Remark. One can also derive Theorem H] from Theorem 1 in Bramson's work 
[5] . The additional advantages of his result are the more general branching rule 
and approximation in the sense of almost sure convergence. However, Theorem H] 
provides more transparent geometric picture of the phenomenon. 

2.2 A limit theorem for the case p > 1/2 

We maintain the notation of the previous subsection but assume now that 

F(Bi = 1) = 1 - F(Bi = -l)=p> 1/2 . 
Let q :— 1 — p. The recurrency equation now takes the form 

M' n+l = min + S«; M'^ + 5(2)} (19) 

where and B^ stand for independent copies of M' n and of a variable B that 
satisfies 

f(B = 1) = 1 - F{B = 0) = q. 

In terms of the distribution tails F n (x) := P(M^ > x), we obtain an equation 
analogous to ( TTT1) . namely, 

F n+1 (x) = [F n (x)p + F n (x - l)qf . (20) 

There is a big difference with respect to the previous case: now there exists a 
unique invariant non-degenerate solution satisfying the equation 

F{x) = [F{x)p + F(x - l)qf (21) 

and the initial condition F{x) = 1, x < 0. Namely, 

F(a?) = (2p 2 )" x 1 - 2F{x - l)pq - ^1 - AF(x - l)pq , x > 0. 

Therefore, it is not surprising that a limit theorem holds in this case. 
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Theorem 6 Uniformly over x G Z, the monotone convergence F n {x) /* F{x) 
holds. 

Proof. First, by induction in x we derive from (|20|) that the sequence F n (x) 
is non- decreasing in n for each fixed x. Therefore, the limit F(x) := lim n F n (x) 
exists and satisfies equation (|2"TT) . It remains to prove that it is non-degenerate, 
i.e. it is different from identical unit. For this purpose, it is enough to notice that 

F n (l) = P(M; > 1) = P(n - M n > 2) = P(M n < n - 2) 

coincides with extinction probability of the supercritical Galton- Watson process 
with the progeny number N defined by the law 







fc = 





W(N = k) = | 




k = 


1 




U 2 > 


k = 


2 



Therefore, 1 — F(l) is the survival probability of the process, which is strictly 
positive, as p > ~. □ 

2.3 Some results for the case p < 1/2 

In what concerns limit theorems, not more is known for this case than for the 
hierarchical summation scheme with general independent random variables having 
finite exponential moments. For p = P(B = 1) < 1/2 the equation fl2T|) has no 
non-trivial solutions, therefore, the behavior of maxima is completely different than 
in the previous cases - a drift with constant speed appears. Once we eliminate this 
linear drift, the distributions of M n form a dense set with exponentially decreasing 
tails. 

Recall that a family of random variables (X n ) is called shift- compact, if there 
exists a real sequence (a n ) such that the distributions of random variables X n — a n 
form a tight family on the real line, i.e. 

lim supP{|X n - a n \ > K} = 0. 

K— >oo n 



Proposition 7 Let p < 1/2. Then the sequence of random variables M n is shift- 
compact, while 

EM n ~ pn, n — > oo, 
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where the shift coefficient p is defined from equation 

2p p q 1 - p = p p (l- P y- p . (22) 

Proof. The result follows, e.g., from Theorem 1.1 in [7\. It is worthwhile to 
notice that the equation for the drift fl22|) is essentially the special case of equation 
([7j) providing reduction to the critical case. □ 



3 Cyclic theorems for maxima of independent 
sums 

Let (£i)i e N be integer i.i.d. random variables. Consider the sum S n := X)ILi£tj 
and let S„\ 1 < j < 2™, be independent copies of S n . We are interested in the 
behavior of M n := maxj< 2 n Sn- 

We will assume that our random variables satisfy 

E|£i|<oo and Eexpiofi} < oo, V7 > 0. (23) 

Let u be the upper bound of the distribution, 

u := sup{m G N : P(6 = m) > 0}. 

Assume that one of the two following assumptions is satisfied: either 

(i) u = 00, 

or 

(ii) oj < 00 and P (^1 = u) < 1/2. 
Since the cumulant 

L( T ) :=lnEexp{ 7 ^} 

is a convex function of 7, the function £(7) — 71/(7) is non-increasing. It is 
continuous and vanishes at 7 = 0. Moreover, if (1231) holds, and any of assumptions 
(i) or (ii) is satisfied, it is easy to show that 

hm [L(7)- 7 L'(7)] <ln(l/2). 

7— >+oo 

Therefore, a solution of equation 

L(7)-7L'( 7 )=ln(l/2) (24) 
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exists on (0, +00). Let denote it 7* and let p* := £'(7*). Notice also that under 
either (i) or (ii) the distribution of £j is non-degenerated (not concentrated at a 
single point), therefore the solution of (J21]) is unique. 

Theorem 8 Let (123]) and either (i) or (ii) holds. Let p*,7* be defined by equation 
(|24j). T/ien 

P |m„ < p , n - ^ + .I = exp {- e^-7,4(l + o(l)) | 
I 2 7* J I V27ro-(7*)(l -e-T*) J 

where o~(-) 2 = L"(-), uniformly ove$\ 

Inn 



z e 



Z - p*n + 



2 7 * 

for any bounded interval I. 

We can rewrite formula (|25|) as 

P{M n <m} = exp{-exp{-7,(m-a n )}(l + o(l))}, m G Z, (26) 

where 

ln[ v / 2^(x(7,)(l -e">)] 
a n := p^n . 

7* 

For each a G M let J-" a denote the distribution on integers given by 

.F a ((m, +00)) = exp {— exp{— 7*(m — a)}} , m G Z. 

Then (J-" a )aeR is a curve in the space of distributions. It is natural to perceive it as 
a helix, in view of 1-periodicity up to a shift: J-" a+1 {m+l} = J-" a {m}. Relation f l26|) 
shows that the distribution of r.v. M n admits the uniform approximation by the 
helix element J 7 " 1 ", while after appropriate centering it admits an approximation by 
the element of the helix turn (J ra )o< a <i- Moreover any distribution (J ra )o< a <i 
is a limit of some subsequence of centered distributions of M n . 

The proof of Theorem [U which is supposed to be published separately, is based 
on a large deviation theorem due to V.V. Petrov [131 Complement 2 in §4 Chapter 
VIII]. 



-'^In other words, we consider z such that the expression in the left hand side is an integer 
number. 
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Let us consider Bernoulli case as an example. Let = Bi be independent 
random variables having non-symmetric Bernoulli distribution, i.e. 



P(A = 1) = 1 - P(5 i = -1) = p < 1/2 . 

Let the drift coefficient p* be again defined by equation (1221) . We also need two 
auxiliary constants k := p ^~~ p *^ G (0, 1) and f3 := 27rp*(l — p*). Then the result of 
Theorem [8] takes the following form. 

Theorem 9 We have 

P |M n < p*n - |^ + zX = exp {-^r (1 + o(l))} , (27) 
uniformly over 



Z - p^n + 



2 Ihk 



/or any bounded interval I. 



Remark. For p > | neither of conditions (i), (ii) holds. Equation pljl has no 
solutions, thus Theorem [8] does not apply. 

The author is deeply indebted to Irina Kurkova and to Zhan Shi for interesting 
discussions, for providing important references, and, most of all, for motivation to 
write this article. 
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